From visuo-motor to language
نویسندگان
چکیده
We propose a learning agent that first learns concepts in an integrated, cross-modal manner, and then uses these as the semantics model to map language. We consider the action of throwing, considering the whole trajectory as a single image. A large set of such images, and the throwing parameters are mapped jointly onto a low-dimensional non-linear manifold. Such models improve with practice, and can be used as the starting point for real-life tasks such as aiming (e.g. dart throwing). How can such models can be used in learning language? We consider a set of videos involving throwing and rolling actions. These actions are analyzed into a set of contrastive semantic classes based on the agent, action, and the thrown object (trajector). We obtain a crowdsourced commentaries for these videos (unannotated text) from a number of adults. A learner system attempts to learn labels using the contrastive probabilities for a given semantic class. Only a handful of highconfidence words are found, but the agent starts off with this partial knowledge. These are used to learn a potential set of syntactic patterns, for example for the trajector, and then for the agenttrajector-action sentences. We demonstrate how this may work for two completely different languages English and Hindi, and also show how rudiments of agreement, synonymy and polysemy are detected.
منابع مشابه
Discovering a Language for Human Activity
We present a roadmap to a Human Activity Language (HAL) for symbolic manipulation of visual and motor information in a sensory-motor system model. The visual perception subsystem translates a visual representation of action into our visuo-motor language. One instance of this perception process could be achieved by a Motion Capture system. We captured almost 90 different human actions in order t...
متن کاملUnderstanding visuo-motor primitives for motion synthesis and analysis
The problem addressed in this paper concerns the representation of human movement in terms of atomic visuo-motor primitives considering both generation and perception of movement. We introduce the concept of kinetology, the phonology of human movement, and five principles on which such a system should be based: compactness, view-invariance, reproducibility, selectivity, and reconstructivity. We...
متن کاملGrounded Representations for Sensory-Motor Learning: A Linguistic Approach
We have empirically discovered that the space of human actions has a linguistic structure. This is a sensory-motor space consisting of the evolution of the joint angles of the human body in movement. The space of human activity has its own phonemes, morphemes, words (nouns, verbs, adjectives, adverbs), and sentences formed by syntax. This has a number of implications for the grounding problem a...
متن کاملPresupplementary motor area activation during sequence learning reflects visuo-motor association.
In preceding studies (Hikosaka et al., 1996; Sakai et al., 1998) we have shown that the presupplementary motor area (pre-SMA), an anterior part of the medial premotor cortex, is active during visuo-motor sequence learning. However, the paradigm required the subjects first to acquire correct visuo-motor association and then to acquire correct sequence, and it was still unknown which of the two p...
متن کاملEffects of retinal position on the visuo-motor adaptation of visual stability in a virtual environment
Although the retinal image changes a great deal with the movement of our head or eyes, we perceive a stable world (a phenomenon known as visual stability or position constancy). Visual stability adaptively changes for each new combination of vision and head motion, or to compensate for manipulated visuo-motor gain. This study aims to investigate the effects of retinal positions on visuo-motor a...
متن کاملHaptics in teaching handwriting: the role of perceptual and visuo-motor skills.
Two studies were carried out in order to better understand the role of perceptual and visuo-motor skills in handwriting. Two training programs, visual-haptic (VH) and visual (V), were compared which differed in the way children explored the letters. The results revealed that improvements of VH training on letter recognition and handwriting quality were higher than improvements after V training....
متن کامل